Testing homogeneity of a large data set by bootstrapping
نویسندگان
چکیده
It is not rare to analyze large data sets these days. Large data is usually of census type and is called the micro data in econometrics. The basic method of analysis is to estimate a single regression equation with common coefficients over the whole data. The same applies to other method of estimation such as the discrete choice models, Tobit models, and so on. Heterogeneity in the data is usually adjusted by the dummy variables. Dummy variables represent socioeconomic differences among individuals in the sample. Including the coefficients of dummy variables, only one equation is estimated for the whole large sample, and it is usually not preferred to divide the whole sample into sub-samples. Data is said to be homogenous in this paper if a single equation is fit to the whole data, and it explains socioeconomic properties of the data well. We may estimate an equation in each sub-population if the whole population is divided into known subpopulations. It is assumed that the coefficients are different from one sub-population to another in this case. Data is said to be heterogeneous in our paper. The analysis of variance is applied if sub-populations are known and sub-sample is collected from each subpopulation.
منابع مشابه
A New Robust Bootstrap Algorithm for the Assessment of Common Set of Weights in Performance Analysis
The performance of the units is defined as the ratio of the weighted sum of outputs to the weighted sum of inputs. These weights can be determined by data envelopment analysis (DEA) models. The inputs and outputs of the related (Decision Making Unit) DMU are assessed by a set of the weights obtained via DEA for each DMU. In addition, the weights are not generally common, but rather, they are ve...
متن کاملAn Integrated DEA and Data Mining Approach for Performance Assessment
This paper presents a data envelopment analysis (DEA) model combined with Bootstrapping to assess performance of one of the Data mining Algorithms. We applied a two-step process for performance productivity analysis of insurance branches within a case study. First, using a DEA model, the study analyzes the productivity of eighteen decision-making units (DMUs). Using a Malmquist index, DEA deter...
متن کاملWeighted tests of homogeneity for testing the number of components in a mixture
An important but di-cult problem in .tting .nite mixture models is estimating and testing the number of components in the mixture. Regularity conditions do not hold for large sample likelihood theory so that likelihood ratio tests cannot easily be implemented. However, a number of homogeneity tests have been developed to test for the presence of a mixture. Weighted versions of homogeneity tests...
متن کاملNonparametric Estimation and Testing in Panels of Intercorrelated Time Series
We consider nonparametric estimation and testing of linearity in a panel of intercorrelated time series. We place the emphasis on the situation where there are many time series in the panel but few observations for each of the series. The intercorrelation is described by a latent process, and a conditioning argument involving this process plays an important role in deriving the asymptotic theor...
متن کاملBootstrap Procedures for Testing Homogeneity Hypotheses
Before pooling data on effect sizes (a generic term for parameters of interest in the context of meta-analysis) from different studies, it is important to test for homogeneity of the effect sizes. A well known test for homogeneity is based on Cochran’s chisquare statistic. Our recent investigation showed that when the effect size of interest is a pairwise correlation, Cochran’s homogeneity test...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Mathematics and Computers in Simulation
دوره 78 شماره
صفحات -
تاریخ انتشار 2008